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Description 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates to a semiconduc- 5 
tor integrated circuit such as logical LSI, microcomputer 
or the like. More particularly, the present invention 
relates to a novel function apparatus (or module) which 
improves the speed of a symbol string search process- 
ing, a single-chip microcomputer which includes such 
an apparatus (or module) and a system which uses 
such a microcomputer. 

[0002] As the amount of document data handled by an 
information processing equipment increases, demands 
for high-speed data search became enlarged. In a full 
text search in which the search of text data is made by 
use of not an index but an arbitrarily set keyword, it is 
important to make at a high speed a so-called string 
search or a character string search which makes a 
search for a keyword existing in text data. 
[0003] A high-speed algorithm for search of text data 
for a plurality of keywords has hitherto been known and 
is realized by a software on a general purpose proces- 
sor. However, it is difficult to ensure a practical speed in 
the search of a large-scale database. Recently, there 
are proposed high-speed techniques which use special 
purpose hardwares in order to obtain a sufficient search 
speed. One example of such techniques is disclosed in 
JP-A-64-42784 entitled "CHARACTER STRING COM- 
PARE APPARATUS AND HIERARCHICAL CHARAC- 
TER STRING COMPARE SYSTEM IN THE 
APPARATUS". 

[0004] The prior art disclosed by the JP-A-64-42784 
concerns a special purpose LSI for the use for character 
string search. The LSI includes a memory area for reg- 
istering keywords therein and a logical circuit area for 
performing a search processing by comparing the key- 
words and text data character by character. The plurality 
of keywords are registered in the memory area and the 
text data is searched for those keywords. The number of 
keywords and the keyword length, which can be set 
simultaneously, are restricted by the size of the memory 
area. The above prior art teaches a method and means 
by which many keywords can be registered by saving 
the memory area. 

[0005] More especially, each keyword is symbolized 
by hierarchical division thereof into short character 
strings. In the case where a plurality of analogous com- 
posite words are set as keywords, that is, in the case 
where the same character string pattern appears in a 
plurality of keywords, it is possible to efficiently utilize 
the memory area since a divisional character string pat- 
tern can be used in common. Accordingly, the number 
of keywords increases which can be registered. How- 
ever, since the hierarchical symbolization is made, a 
procedure for making a matching with the original key- 
word becomes necessary. This causes the increase in a 
processing time. Also, since the overall search process- 
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ing is performed by the LSI for character string search, 
there is involved a problem that the upper limit of the 
scale of a circuit, which can be formed with LSI configu- 
ration, restricts a function and the number of keywords 
capable of being processed and hence only the charac- 
ter string search within the restricted range is possible. 
[0006] As mentioned, the above prior art has the prob- 
lem that the search function and the number of key- 
words capable of being simultaneously searched out 
are restricted by the scale of the hardware. In a charac- 
ter string search, there may frequently be required a so- 
called approximate search function, that is, a function of 
searching text data even for keywords which do not 
exactly match with a desired keyword. Therefore, the 
circuit scale has a tendency to further increase. How- 
ever, if the circuit scale becomes too large so that the 
cricuit must be LSI-configured on a plurality of chips, the 
merit of the LSI configuration is decreased since it is not 
possible to make the best use of the high-speed ability. 
Further, the use of the special purpose hardware 
requires a host CPU for controlling this hardware. This 
also causes the increase in the number of chips. 
[0007] GB-A-2 200 228 discloses a computer includ- 
ing a central processing unit which is connected via 
buses to a read-only memory, a random access mem- 
ory, input/output units and a content addressable mem- 
ory, wherein the content addressable memory may be 
instructed to record data to seek for data matching an 
input group or to enable data to be read from a particu- 
lar address of the memory. 

[0008] I EE Proceedings-E. Computers & Digital Tech- 
niques, vol. 136, no. 5, September 1989, pages 351 to 
356 describes different layouts for content addressable 
mass memories which allows to select words, the con- 
tent of which is identical to an argument offered as an 
address and stored in a special search register. 
[0009] Elektronische Rechenanlagen, vol. 26, no. 4, 
1984, pages 179 to 185, discloses content addressable 
memories used for time critical search processing. 

SUMMARY OF THE INVENTION 

[0010] An object of the present invention is to provide 
a data processing apparatus which performs simple 
string search at high speed and with a hardware scale 
smaller than that in the prior art. 

[001 1] This object is met by a string search apparatus 
according to claim 1. Preferred embodiments are dis- 
closed in the dependent claims. 

[0012] In the present invention, an apparatus to 
improve the speed of a symbol string search such as a 
character string search (hereinafter referred to as a 
search module or the like) is provided. A CPU and the 
search module are bus-connected through generalized 
or universal address and data buses and take over their 
portions of a symbol string (or character string) search 
processing. 

[0013] More particularly, the search module is con- 
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nected to the CPU through the address bus and the 
data bus and includes function definition means for 
defining a funtion of the search module in accordance 
with a command from the CPU, data input/output 
means for receiving a symbol string to be searched 5 
through the data bus and outputting the result of search, 
search processing means for performing a search 
processing based on a function defined by the function 
definition means by use of a symbol string to be 
searched out which is stored in the search processing 
means itself and the symbol string to be searched which 
is inputted to the data input/output means, and condition 
holding means for holding data indicative of an internal 
condition corresponding to the result of the search 
processing. 

[0014] In the present invention, a software processing 
on the CPU is fundamental in order to perform a symbol 
string (or character string) search or the like at a high 
speed. A portion of that processing is performed by the 
search module at a high speed, thereby improving the 
speed of the overall symbol string (or character string) 
search processing. 

[0015] Generally, a symbol string (or character string) 
search processing includes making successive compar- 
ison or collation with text data from the top or the last of 
a symbol string (or character string). If a mismatching is 
found out in the course of comparing, the on-set posi- 
tion of the text data is shifted and the comparision is 
resumed from the compare start position of the symbol 
string (or character string), that is, the top or the last of 
the symbol string (or character string). Therefore, the 
frequency of compare processing at the compare start 
portion of the symbol string (or character string) 
becomes very high. The processing at this high-fre- 
quency portion is performed by the search module at a 
high speed and the other low-frequency portion is soft- 
ware-processed by the CPU. Since the high-frequency 
portion to be processed at a high speed is restricted to 
the vicinity of the compare start position of the symbol 
string (or character string), a relatively small scale cir- 
cuit suffices for the search module. Therefore, it is also 
possible to integrate the search module and the CPU on 
the same LSI chip. Thereby, it becomes possible to pro- 
vide means which searches for a symbol string (or char- 
acter string) at a high speed and with a hardware scale 
smaller than that in the prior art. 
[0016] In the present invention, the CPU and the 
search module are connected through generalized 
buses (or the address bus and the data bus) and data to 
be beforehand set in the search module is written by 
CPU. The execution of a symbol string (or character 
string) search processing is activated by transferring the 
text data to the search module. The result of comparing 
is outputted to a result storage in the data input/output 
means of the search module and is read therefrom and 
interpreted by the CPU. 

[0017] The CPU and the search module are not con- 
nected by a special data bus but by a generalized bus. 



4 

Observed from the CPU, the search module looks as 
being mapped in memory address map. Therefore, the 
execution of a search processing and the acquisition of 
the result can be made easily in a procedure similar to 
the procedure of data transfer for a memory. Mapping at 
any address is possible. Therefore, in the case where 
the configuration of a single-chip microcomputer is 
employed, good matchability with another module is 
obtained. Also, if the CPU and the search module is 
integrated as a search processor on the same semicon- 
ductor chip, the speed of data transfer between the CPU 
and the search module is improved. Further, it becomes 
possible to structure an application system with a hard- 
ware scale smaller than that in the prior art. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0018] 

Fig. 1 is a block diagram for explaining the construc- 
tion of a first embodiment of the present invention; 
Fig. 2 is a block diagram for explaining the case 
where the construction of the first embodiment of 
the present invention is applied to another purpose; 
Fig. 3 is a block diagram for explaining a more 
detailed embodiment of a search module shown in 
Fig. 1; 

Fig. 4 is a diagram for explaining the address map- 
ping of memories and control registers in the 
search module; 

Fig. 5 is a diagram for explaining the formats of 
commands which define functions of the search 
module; 

Fig. 6 is a block diagram for explaining a more 

detailed embodiment of data input/output means of 

the search module shown in Fig. 3; 

Fig. 7 is a block diagram for explaining a more 

detailed embodiment of a path selector of the 

search module shown in Fig. 3; 

Fig. 8 is a diagram for explaining the format of a 

compare control memory of the search module 

shown in Fig. 3; 

Fig. 9 is a block diagram for explaining an embodi- 
ment of a content addressable memory of the 
search module shown in Fig. 3; 
Fig. 10 shows a true value table for explaining the 
function of a compare logic for producing the result 
of matching in consideration of a compare condi- 
tion; 

Fig. 1 1 is a block diagram for explaining an embod- 
iment of a compare result analyzing circuit of the 
search module shown in Fig. 3; 
Fig. 12 is a block diagram for explaining an embod- 
iment of a plural-hit analyzing block in the compare 
result analyzing circuit shown in Fig. 1 1 ; 
Fig. 13 is a diagram for explaining the format of data 
outputted to an output register of the data input/out- 
put means of the search module shown in Fig. 3; 
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Fig. 14 is a diagram for explaining the format of a 
condition code register of a control section of the 
search module shown in Fig. 3; 
Fig. 15 is a diagram for explaining an embodiment 
of input and output signals of the search module; 5 
Fig. 16 is a block diagram for explaining an embod- 
iment of a search processor in which the construc- 
tion of the first embodiment of the present invention 
is integrated on one chip; 

Fig. 17 is a block diagram for explaining an embod- 10 
iment of a magnetic disk system which has the 
search processor of Fig. 16 incorporated therein 
and is therefore provided with a character string 
search function; 

Figs. 18A and 18B are diagrams for comparatively 15 
explaining the operation of the conventional mag- 
netic disk system and the operation of the inventive 
magnetic disk system of Fig. 17 provided with the 
character string search function, respectively; and 
Fig. 1 9 is a block diagram for explaining a host CPU 20 
system which includes an interface for the magnetic 
disk system of Fig. 17 having the character string 
search function. 



DESCRIPTION OF THE PREFERRED EMBODI- 25 
MENTS 

[0019] Fig. 1 shows a block diagram of an embodi- 
ment of the present invention. A CPU 1 00 and a search 
module 200, which is a multi-function module, are con- 30 
nected through generalized address and data buses. 
The search module 200 includes data input/output 
means 202 for data transfer from/to the CPU 100, func- 
tion definition means 203 for defining the function and 
operation of the module, condition holding means 204 35 
for reflecting or holding the result of a processing in the 
search module, and search processing means 201 the 
function of which is defined by the function definition 
means 203 and which takes in necessary data from the 
data input/output means 202 and delivers, to the data ao 
input/output means 202, data to be outputted as the 
result of a search processing performed. These compo- 
nents are connected to the data bus and can be 
accessed by addressing from the CPU 100. The opera- 
tion of each component will now be explained. 45 
[0020] A character string(s) or a partial character 
string(s) is beforehand prepared by a preprocessing 
from a keyword(s) to be searched out and is set into the 
search processing means 201 of the search module 
200. A mode, which the search module 200 may take, is so 
roughly classified into two modes (or a data set mode 
and a search mode), depending on a function defined. 
In this case, the search module 200 is in the data set 
mode. Next, a command is written from the CPU 100 
into the function definition means 203 in order to select 55 
a necessary one from among a plurality of functions of 
the search processing means. When a character string 
search command is written into the function definition 



6 

means 203, the search module 200 turns into a search 
mode and is thereby ready for a search processing. 
[0021] The search processing is activated by the write 
of one word of text data (or character data to be 
searched) into the data input/output means 202. 
Namely, CPU 100 successively writes the text data into 
the data input/output means 202 of the search module 
200. The text data is sent from the input/output means 
202 to the search processing means 201 in which it is 
compared with all the beforehand set character strings 
(or character strings to be searched out). This process- 
ing is called a leading collation processing. If there is 
any character string which matches with the text data, 
data corresponding to that character string is outputted 
to the data input/output means 202. Even in the case 
where there is no character string which matches with 
the text data, data giving notice of the mismatching is 
outputted to the data input/output means 202. The CPU 
100 makes access to the search module 200 to read 
and interpret data outputted to the data input/output 
means 202. The CPU 100 determines the next opera- 
tion in accordance with the result of interpretation. 
Namely, if the character strings set in the search module 
include one which matches with the inputted text data, 
i.e. the matching with a compare start portion of a cer- 
tain keyword. Accordingly, the CPU 100 makes a com- 
parision with the succeeding character strings of that 
keyword. This processing is called a posterior collation 
processing. On the other hand, in the case where there 
is no character string which matches with the inputted 
text data, the CPU 100 performs no posterior collation 
processing or immediately writes the next text data into 
the data input/output means 202. The search module 
not only outputs the result of the search processing to 
the data input/output means 202 but also reflects the 
internal condition of the module and the result of search 
to the condition holding means 204. Since the content of 
the condition holding means 204 is directly connected to 
the CPU 100, it is also possible for the CPU to deter- 
mine the next operation from that information. 
[0022] The function of the search processing means 
201 may include the designation of an exact search or 
an approximate search, that is, a search allowing an 
erroneous character(s), the designation of an anchor 
matching search or a non-anchor matching search, the 
designation of a character code length (that is, the des- 
ignation of a one-byte code or a two-byte code), and so 
on. 

[0023] In the conventional arithmetic module, a set of 
an operand and a command as required are transferred 
from a CPU. In the present invention, the CPU 100 
firstly writes a command into the function definition 
means 203 of the search module to define one of the 
combinations of the above-mentioned functions of 
search. Next, the CPU 100 writes the text data as an 
operand into the data input/output means 202. Since a 
search processing is activated by the write of the data, 
the succeeding comparision of the text data with the 
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character string(s) set in the search module can be 
made through the repetition of the write of data from the 
CPU 100 into the search module 200 and the read of 
data from the search module to the CPU 100. This 
processing can be performed in a procedure similar to 5 
the procedure of access from a CPU to a memory. 
[0024] As apparent from the above, according to the 
embodiment of the present invention, it becomes easily 
possible to make the execution of a search processing 
and the acquisition of the result in a procedure similar to 10 
the procedure of data transfer for a memory. Also, the 
search module can be mapped at any address. There- 
fore, in the case where the configuration of a single-chip 
microcomputer is employed, good matchability with 
another module is obtained. Further, since the search 15 
module performs only a leading collation processing, it 
is possible to realize the search module with a small- 
scale hardware. Therefore, the CPU and the search 
module can be integrated as a search processor on the 
same semiconductor chip, thereby improving the speed 20 
of data transfer between the CPU and the search mod- 
ule. Thus, there is obtained an effect that a character 
string search processing by a single-chip microcom- 
puter can be performed at a much higher speed as com- 
pared with that by the conventional apparatus. 25 
[0025] The construction of the embodiment shown in 
Fig. 1 is applicable to the case where a multi-function 
arithmetic module other than the search module is 
used, which is not covered by the present invention One 
example of such a case is shown in Fig. 2. In the exam- 30 
pie shown in Fig. 2, the construction of the embodiment 
shown in Fig. 1 is applied to a floating point arithmetic 
module 210. The floating point arithmetic module 210 
includes floating point arithmetic means 211, data 
input/output means 202, function definition means 203 35 
and condition holding means 204. 
[0026] Firstly, a command is written from a CPU 100 
into the function definition means 203 in order to define 
a function of the floating point arithmetic means 211. 
Thereby, the function of the floating point arithmetic 40 
module 210 is defined to be, for example, multiplication, 
division, sum-of-products arithmetic, or the like. Next, 
the CPU 100 writes a necessary operand(s) into the 
data input/output means 202. When a required number 
of operands for the defined arithmetic become com- 45 
plete, an arithmetic processing routine is activated. 
After the arithmetic processing, the result of arithmetic 
is written into the data input/output means 202 and the 
internal condition of the module and a condition code is 
written into the condition holding means 204. The CPU so 
100 can fetch the result of arithmetic processing by 
reading those information. In the case where an arith- 
metic processing is to be continuously executed with the 
same function, the arithmetic processing can be per- 
formed continuously by writing operands into the data 55 
input/output means 202 and reading the results of arith- 
metic processing. Namely, in the conventional method, 
a command is transferred each time one arithmetic 



processing is executed. On the other hand, in the 
present invention, it is sufficient to write commands into 
the function definition means 203 at once prior to the 
continuous execution of arithmetic processing. There- 
fore, the arithmetic processing speed is improved. 
[0027] By applying the construction of the present 
invention shown in Fig. 1 to a multi-function arithmetic 
processing module as in the above embodiment, there 
is obtained an effect that the processing speed of a sin- 
gle-chip microcomputer including the multi-function 
arithmetic processing module is improved. 
[0028] Fig. 3 shows a more detailed embodiment of 
the search module 200 according to the present inven- 
tion. The module 200 includes a data input/output sec- 
tion 300, a search processing section 400 and a control 
section 800. Though not strictly correspond to the con- 
struction shown in Fig. 1 , the data input/output section 
300 corresponds to the data input/output means 202, 
the search processing section 400 corresponds to the 
search processing means 200, and the control section 
800 corresponds to the function definition means 203 
and the condition holding means 204. Each section is 
connected to a data bus through an internal bus 220 of 
the module 200. The bit width of a data line in the draw- 
ings is 16 bits, excepting the case where a specified 
designation is given. However the present embodiment 
can also be realized by using a data line having another 
different bit width. 

[0029] The data input/output section 300 is composed 
of three data input registers (DIR0) 310, (DIR1) 320 and 
(DIR2) 330 and one data output register (DOR) 340. 
The data input register has a data shifting function. Data 
written in the data input register (DIR2) 330 is succes- 
sively shifted to the data input register (DIR1) 320 and 
then to the data input register (DIR0) 310. Accordingly, 
the successive writing of data into the data input register 
(DIR2) 330 from a CPU suffices to cause the flow of 
data between the data input registers. The contents of 
the data input registers (DIR0) 310 and (DIR1) 320 are 
delivered as data of 32 consecutive or connected bits to 
the search processing section 400. The data output reg- 
ister (DOR) 340 receives and stores the result of search 
from the search processing section 400. The data input 
and output registers 310 to 340 are connected to the 
internal bus 220 so that direct access to them from the 
CPU is possible. 

[0030] The search processing section 400 is com- 
posed of a mask register (MSK0, MSK1) 410 for mask- 
ing (or designating a mask to) each bit of data sent from 
the data input/output section 300, a path selector (PATH 
SEL) 420 for making a positional interchange of data in 
units of a byte, a content addressable memory (CAM) 
500 for storing a character string(s) or a partial charac- 
ter string(s) to be searched out, a compare control 
memory (CCM) 600 for storing a condition for compari- 
sion in the content addressable memory 500, a com- 
pare result analyzing circuit 700 for analyzing the 
number of words matched from the result of comparing 
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in the content addressable memory 500, a priority 
encoder (PE) 430 for outputting matching signals of the 
words matched one by one, a RAM 440 for storing out- 
put data, and a non-hit register (NHR) 450 for storing 
output data when no matching is obtained. 5 
[0031] The content addressable memory 500 
receives, as inputs, data from the data input/output sec- 
tion 300 through the mask register 410 and the path 
selector 420 and the content of the compare control 
memory 600. (Details thereof will be mentioned later on 
by use of Fig. 9.) The result of compare is transferred to 
the compare result analyzing circuit 700 and to the pri- 
ority encoder 430. An output of the priority encoder 430 
directly becomes an address of the RAM 440. Thereby, 
the content of the RAM 440 is read. The read result is 
transferred to the data input/output section 300 and is 
held by the data output register (DOR) 340 of the data 
input/output section 300. On the other hand, the result 
of analysis by the compare result analyzing circuit 700 
is transferred to the control section 800. Thereby, the 
selection is made of whether data to be set into the data 
output register (DOR) 340 is the data of the non-hit reg- 
ister 450 or the content of the RAM 440. 
[0032] The control section 800 is composed of a con- 
dition code register (CCR) 810 as condition holding 
means, a function definition register (FDR) 820 as func- 
tion definition means, and a control circuit 830. The 
function definition register 820 and the condition code 
register 810 are connected to the internal bus of the 
search module 200. The function definition register 820 
transfers the content thereof to the control circuit 830 to 
make a control for a processing sequence. On the other 
hand, the condition code register 810 receives data 
indicative of the condition of the search module during 
processing or after processing from the control circuit 
830. This data is also delivered to the CPU directly by a 
condition code signal line 230. The control circuit 830 
controls the data transfer and the execution of process- 
ings in the parts of the data input/output section 300 and 
the search processing section 400. Control lines for this 
purpose are omitted from Fig. 3. 

[0033] The above-mentioned memory and various 
control registers incorporated in the search module 200 
are address-mapped and can be accessed from the 
CPU by addressing. An address bus is omitted from Fig. 
3. 

[0034] An address map is shown in Fig. 4. In the 
present embodiment, the content addressable memory 
(CAM), the RAM, the compare control memory (CCM) 
and each control register are constructed with 32 bits x 
32 words, 16 bits x 32 words, 1 1 bits x 32 words and 16 
bits, respectively. In the search module, the memories 
and the control registers have sequential address 
assignment from the 0-th address in individual areas. A 
change-over between the memory area and the control 
register area is made by a selection signal for selecting 
each area. Upon access, an address signal and an area 
selection signal are combined. 
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[0035] Though the compare control memory 600 and 
the content addressable memory 500 are assigned to 
the same address of the same area, an access change- 
over is made in accordance with the definition of the 
function of the search module 200 as will be explained 
later on. Namely, access to the compare control mem- 
ory area becomes possible only in the case where the 
access to the compare control memory area is defined 
for the function definition register 820. In usual, the 
selection of a memory area results in access to the con- 
tent addressable memory area. In this manner, the 
memory area of the search module 200 can take a con- 
struction similar to that of a usual memory. Thereby, 
there is obtained an effect that mapping at any address 
on a memory map is possible in the system level. The 
present embodiment can also be realized by use of 
another bit/word construction. 

[0036] Next, the operation of the present embodiment 
will be explained. 

[0037] A character string search is a processing for 
searching a text (which is a character string to be 
searched) for a keyword (which is a character string to 
be searched out). It is assumed that a character(s) to be 
set into the content addressable memory area of the 
search module and data to be set into the RAM area 
corresponding thereto are beforehand determined from 
a given keyword(s). 

[0038] Firstly, as preparation for a search processing, 
data are set into the content addressable memory area 
and the RAM area of the search module 200. Also, a 
command to define access to the compare control 
memory area is set into the function definition register 
820 and data is set to the compare control memory 
area. Thereafter, a command to define a character 
string search is set into the function definition register 
820. The search module 200 turns into a search mode 
and is thereby ready for the search. Text data is written 
from the CPU into the data input registers (DIR0 - 2) 310 
to 330 to perform the search processing. 
[0039] The data input/output section 300 shown in Fig. 
3 transfers the contents of the data input registers 
(DIR0) 310 and (DIR1) 320 to the search processing 
section 400 in the form of data of 32 consecutive bits. 
For this purpose, there may be a method in which data 
directly written in the data input registers (DIR0) 310 
and (DIR1 ) 320 are transferred to the search processing 
section 400 as they are and a method in which data 
existing in the data input registers (DIR0) 310 and 
(DIR1) 320 are transferred to the search processing 
section 400 while data written in the input register 
(DIR2) 330 is successively shifted in a sequence of 
DIR2 -> DIR1 -> DIR0 in a state in which data are 
present in the data input registers (DIR0) 310 and 
(DIR1) 320. Either method can be designated by a 
specified field of the character string search command 
set into the function definition register 820. 
[0040] The 32-bit data transferred to the search 
processing section 400 is masked by the mask register 
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410 and is then delivered to the path selector 420. How- 
ever the data may not be subjected to any masking. The 
path selector 420 makes an interchange or replacement 
of the 32-bit data in bit position and in byte position. 
Thereby, an approximate search can be realized. But 5 
the bit position and the byte position may be kept as 
they are as well. The input data is transferred through 
the path selector 420 to the content addressable mem- 
ory 500. The input data is simultaneously compared 
with all the characters or data which are beforehand set w 
in the content addressable memory 500. In this case, 
referring to the content of the compare control memory 
600, the decision of a matching or a mismatching is 
made in accordance with the indicated condition. One 
word of the compare control memory 600 has a one-to- 15 
one correspondence to one word of the content 
addressable memory 500, and the compare control 
memory 600 designates a condition for comparision 
with data of the content addressable memory 500 in 
units of one byte. The compare condition may include a 20 
positive logic condition under which a matching is deter- 
minated when the set data and the input data are the 
same, a negative logic condition under which a match- 
ing is determined when they are different, a do not care 
condition under which a matching is always determined 25 
and an invalid condition under which a mismatching is 
always determined. In addition, the adoption/non-adop- 
tion of a search condition designated by the function 
definition register 820 can be designated individually. 
The matching/mismatching in the result of comparing 30 
with the input data is a logical AND of the results of com- 
paring for respective bytes. If words match, the corre- 
sponding contents of the RAM 440 are successively 
read by the priority encoder (PE) 430 and are set into 
the data output register (DOR) 340. From the result of 35 
comparing in the content addressable memory 500, the 
compare result analyzing circuit 700 makes a decision 
of the presence/absence of any word matched and a 
decision as to whether the number of words matched is 
one or plural. In the case where there is no matching 4 q 
word the content of the non-hit register (NHR) 450 is set 
into the data output register (DOR) 340. Information 
acquired by the compare result analyzing circuit 700 is 
sent to the control section 800 to be used for a 
sequence control of the search module 200. Also, the 45 
internal condition during the search processing is set 
into the condition code register (CCR) 810. The condi- 
tion code register (CCR) 810 is directly connected to the 
CPU by the dedicated bus 230 so that the CPU can 
refer to the CCR, as required. However the CCR can 50 
also be accessed through the generalized data bus. 
[0041] Reading the content of the data output register 
(DOR) 340, the CPU interpretes it and takes the next 
action. Namely, if the matching of the text data with the 
set character string occurs in the search module 200, i.e 55 
the matching at a compare start position of the keyword, 
the CPU starts a posterior collection processing. Irre- 
spective of the execution of the posterior collation 
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processing, the CPU successively writes the text data 
into the search module 200 so that a leading collation 
processing is always performed by the search module 
200. 

[0042] Fig. 5 shows the mnemonics and formats of 
commands set into the function definition register 820 in 
the present embodiment. The function of the search 
module 200 is defined by those commands. Each com- 
mand shown in Fig. 5 basically takes a construction of 
16 bits a word but it is of course that another construc- 
tion can be employed. Each command is as follows. 

1. NOP: NON-OPERATION — The search module 
200 performs nothing, and the content addressable 
memory area and the RAM area are made availa- 
ble as a usual memory area. 

2. CRST: CONTENT ADDRESSABLE MEMORY 
(CAM) RESET — A compare condition of the com- 
pare control memory 600 is set to the invalid condi- 
tion so that the content addressable memory area 
is substantially initialized. 

3. CACS: COMPARE CONTROL MEMORY (CCM) 
ACCESS — Access to the compare control mem- 
ory area is enabled in lieu of access to the content 
addressable memory area which is usually access- 
able. 

4. COMPFO: BIT PATTERN SEARCH — A search 
for a bit pattern is made. Data shift in the data input 
registers (DIR0 - 2) 310 to 330 is made in units of 
one bit. A masking operation is possible by referring 
to only the mask register. The COMPFO has the fol- 
lowing option field. 

SH: This designates the shift (ON/OFF) of 
input data. 

OUT0: This designates the output of an interrup- 
tion request signal upon matching detec- 
tion (INTERRUPTION REQUEST 
OUTPUT ON/OFF). 

OUT1: This designates the substitution (CODE 
MIX) or non-substitution (RAM OUT) of 
an internal condition code for a part of 
data outputted from the RAM to the DOR. 

MR: This designates the reference (ON/OFF) 
to the mask register. 

ME: In the case where multi-matching occurs, 
this field selects whether the advance to 
the next step should be taken after all the 
results of matching have been read from 
the data output register (DOR) 340 (ON) 
or after only the first result of matching 
has been read from the data output regis- 
ter (DOR) 340 (OFF). 

5. COMPF1: ONE-BYTE CODE CHARACTER 
STRING SEARCH — A search for a one-byte code 
character string is made. Data shift in the data input 
registers (DIR0 - 2) 310 to 330 is made in units of 
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one byte. A masking operation is designated by an 
option field. The COMPF1 has the following option 
fields in addition to SH, OUT, ME fields similar to 
those mentioned in conjunction with the COMPFO. 

AK: This selects either an anchor search 

(ON) or an non-anchor search (OFF). 

APXM3-0: These designate independently the 
executions (ON/OFF) of four kinds of 
approximate search sequence. 

BMSK3-0: These designates the setting 
(ON/OFF) of one-bit mask to the same 
bit position of each byte and that set- 
ting position. 

6. COMPF2: TWO-BYTE CODE CHARACTER 
STRING SEARCH — A search for a two-byte code 
character string is made. Data shift in the data input 
registers (DJRO - 2) 310 to 330 is made in units of 
two bytes. A masking operation is designated by an 
option field. The COMPF2 has option fields quite 
simitar to those of the COMPF1, excepting the fol- 
lowing BMSK fields. BMSK3-0: These designate 
the setting (ON/OFF) of a one-bit mask to the same 
bit position of bytes 0 and 1 and that setting posi- 
tion. 

7. COMPF3: TWO-BYTE CODE CHARACTER 
STRING SEARCH — A search for a two-byte code 
character string is made. Data shift in the data input 
registers (DIRO - 2) 310 to 330 is made in units of 
two bytes. A masking operation is designated by an 
option field. The COMPF3 has option fields quite 
similar to those of the COMPF1, excepting the fol- 
lowing BMSK fields. 

BMSK3-0: These designate the setting (ON/OFF) 
of a one-bit mask to the same bit posi- 
tion of bytes 1 and 3 and that setting 
position. 

[0043] Next, the constructions of the respective parts 
of the embodiment shown in Fig. 3 will be explained 
together with more detailed embodiments. 
[0044] Fig. 6 shows a more detailed embodiment of 
the data input/output section 300. Data input registers 
(DIRO) 310, (DIR1) 320 and (DIR2) 330 and a data out- 
put register (DOR) 340 are connected to an internal bus 
220 in the search module and are all accessable from 
the CPU. Buffer registers (SHR0) 312 and (SHR1) 322 
and selectors (SEL0) 311, (SEL1) 321 and (SEL2) 331 
for shifting are provided for data shift in the data input 
registers (DIRO - 2) 310 to 330. 

[0045] In the case of one-byte shift, the buffer register 
(SHR0) 312 forms the data input register (DIRO) 310 
after one-byte shift from an upper byte of the data input 
register (DIRO) 310 and a lower byte of the data input 
register (DIR1) 320, and the buffer register (SHR1) 322 
forms the data input register (DIR1) 320 after one-byte 
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shift from a lower byte of the data input register (DIR1) 
320 and an upper byte of the data input register (DIR2) 
330. In the case of two-byte shift, the buffer register 
(SHR0) 312 forms the data input register (DIR)) 310 

5 after two-byte shift from the data input register (DIR1) 
320, and the buffer register (SHR1) 322 forms the data 
input register (DIR1) 320 after two-byte shift from the 
data input register (DIR2) 330. In the case of one-bit 
shift, the buffer register (SHR0) 312 forms the data input 
register (DIRO) 310 after one-bit shift from the most sig- 
nificant bit of the data input register (DIR1) 320 and the 
data input register (DIRO) 310, and the buffer register 
(SHR1) 322 forms the data input register (DIR1) 320 
after one-bit shift from the most significant bit of the data 

15 input register (DIR2) 330 and the data input register 
(DIR1)320. 

[0046] Fig. 7 shows a more detailed embodiment of 
the path selector 420. In order to make an approximate 
search of text data for a character string set in the con- 

20 tent addressable memory area, the path selector 
includes selectors (SEL3 - 6) 421 to 424 for changing 
the byte position of four-byte data which is the contents 
of the DIRO and DIR1. For example, in the case of a 
search for a pattern including the interchange of two 

25 adjacent characters, the four-byte data is transferred to 
the content addressable memory area with the posi- 
tions of two any adjacent bytes among B0, B1, B2 and 
B3 being interchanged. 

[0047] The construction shown in Fig. 7 also has byte 

30 position changing functions for both one-byte data and 
two-byte data. The selection of either the function for 
one-byte data or the function for two-byte data is made 
by a command which is to be set into the function defi- 
nition register 820. 

35 [0048] Fig. 8 shows an embodiment of the format of 
the compare control memory 600. Each word of the 
compare control memory 600 has a one-to-one corre- 
spondence to each word of the content addressable 
memory 500. When the comparision of characters or 

40 data stored in the content addressable memory 500 and 
text data is made, the reference to each word of the 
compare control memory 600 is made as a compare 
condition for the corresponding word of the content 
addressable memory 500. One word is composed of the 

45 following subfields having meanings which are given 
herein below. 

DOC: This indicates whether the 

content of a corresponding 
so word of the content addressa- 

ble memory area is a delim- 
iter or a character. 
AS: In the case where a character 

string search command is set 
55 in the function definition regis- 

ter 820, this subfield indicated 
whether or not an approxi- 
mate search sequnce desig- 
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BM: 



BO -3: 



pstv (positive logic): 

ngtv (negative logic): 

dntc (don't care): 
invd (invalid): 



nated by the command 
should be executed. 
In the case where a character 
string search command is set 
in the function definition regis- 
ter 820, this subfield indicates 
whether or not the reference 
to a bit mask designation des- 
ignated by the command 
should be made. 
These designate a condition 
for compare of characters or 
data stored in the content 
addressable memory 500 and 
text data in units of one byte. 
Each subfield designates one 
of the four following compare 
conditions by use of two bits. 
A matching is determined in 
the case where the stored 
character or data and the text 
data are equal to each other. 
A matching is determined in 
the case where they are dif- 
ferent from each other. 
A matching is always deter- 
mined. 

A mismatching is always 
determined. 



[0049] Fig. 9 shows an embodiment of the content 
addressable memory 500. The content addressable 
memory 500 makes a compare of characters or data 
stored therein and text data while referring to the com- 
pare control memory 600. 

[0050] In Fig. 9, each of the content addressable 
memory 500 and the compare control memory 600 is 
shown to have a construction of one word, and data 
lines, memory cells and comparators are shown in units 
of eight bits. The overall content addressable memory 
and compare control memory includes the repetition of 
the shown construction. The following explanation will 
be made of the shown one-word construction and its 
operation. 

[0051] In a data set mode, the write of data into mem- 
ory cells (memO) 510, (mem1) 520, (mem2) 530 and 
(mem3) 540 of the content addressable memory and 
memory cells (B0) 610, (B1 ) 620, (B2) 630 and (B3) 640 
of the compare control memory is possible. The con- 
nection of data paths between the data lines and the 
memory cells is established by the control section 800 
in accordance with the content of the function definition 
register (FDR) 820. A character string or a partial char- 
acter string is set into the content addressable memory 
500, and compare conditions are set into the compare 
control memory 600. In a search mode, the connection 
of data paths between the data lines and comparators 
(compO) 511, (compl) 521, (comp2) 531 and (comp3) 
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541 is established. The text data on the data lines and 
the character string or partial character string stored in 
the memory cells (memO - 3) 51 0 to 540 of the content 
addressable memory 500 are inputted to the compara- 
tors (compO - 3) 51 1 to 541 to make a compare for each 
bit. The results of comparing in units of one bit are 
ANDed by logical AND gates 512, 522, 532 and 542 in 
units of one byte to produce the results 515, 525, 535 
and 545 of comparing in units of one byte. For these 
results, compare logics 51 3, 523, 533 and 543 liberate 
the compare conditions stored in the memory cells (B0 
- B3) 610 to 640 of the compare control memory. A 
detailed embodiment of the compare logic is shown in 
Fig. 9. Inputs of the compare logic include the result of 
comparing for one byte and a compare condition repre- 
sented by two bits (indicated by M and L bits in Fig. 9). 
For these inputs, the results 516, 526, 536 and 546 of 
comparing in units of one byte taking the compare con- 
ditions into consideration are outputted in accordance 
with a truth table shown in Fig. 10. Finally, a logical AND 
of the results of comparing for the four bytes is produced 
by an AND gate 550. Thereby, the search-out of the 
character string or partial character string set in the con- 
tent addressable memory is outputted as a matching 
signal 560. The AND gate 550 has an input for a control 
signal 650 which invalidates the matching signal uncon- 
ditionally. 

[0052] Fig. 1 1 shows an embodiment of the compare 
result analyzing circuit 700. The compare result analyz- 
ing circuit 700 receives a matching signal from the con- 
tent addressable memory 500 and outputs signals XX, 
YY and ZZ for examining the presence/absence of hit in 
the matching signal and examining in the case of the 
presence of hit whether or not the number of hits is plu- 
ral. The matching signal from the content addressable 
memory 500 is passed through the priority encoder 
(PE) 430. The construction of the present embodiment 
has, as a fundamental constituent unit, blocks (MHD0) 
710, (MHD1) 720, (MHD2) 730 and (MHD3) 740 for 
making the analysis of plural hits for eight-bit matching 
signals 701 , 702, 703 and 704. The compare result ana- 
lyzing circuit 700 includes the MHD0 - 3 and logical cir- 
cuits for producing the outputs XX, YY and ZZ from 
outputs X0 - 3, Y0 - 3 and Z0 - 3 of the MHD blocks. 
[0053] The detailed construction of the MHD is shown 
in Fig. 12. MHDn (n = 0 - 3) receives the matching sig- 
nals 701 , 702, 703 and 704 for eight words and outputs 
Xn which indicates the presence/absence of hit in eight 
bits, Yn which indicates either plural hits or the other, 
and Zn which indicates either a single hit or the other. 
XX is produced from X0 - 3, YY from Y0 - 3, and ZZ from 
Z0 - 3. The XX indicates the presence/absence of hit 
over all the matching signals, the YY indicates the pres- 
ence/absence of an MHD block including plural hits, and 
the ZZ indicates whether or not an MHD block including 
a single hit is plural in number. In this manner, it 
becomes possible to obtain a logic YY + ZZ which indi- 
cates the presence of plural hits and a logic XX • YY • 
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ZZ which indicates a single hit or the presence of only 
one hit. The information necessary for a sequence con- 
trol is sent to the control section 800. 
[0054] The present embodiment can also be realized 
with the other number of MHD blocks and the other 5 
number of input signals of the MHD block. 
[0055] Fig. 1 3 shows an embodiment of the format of 
the data output register (DOR) 340. Depending on the 
value of an OUTO bit of a character string search com- 
mand set in the function definition register 820, the for- 
mat is classified into RAM OUT in which the content of 
the RAM 440 is outputted as it is and CODE MIX in 
which the content of a lower byte of the RAM and inter- 
nal information are outputted together. 
[0056] In the case of RAM OUT, the content of the 
non-hit register is held in the data output register if there 
is no hit. If there is any hit, the content of the RAM is 
held in the data output register. Accordingly, there is no 
special format. 

[0057] In the case of CODE MIX, the content of the 
non-hit register is held in the data output register. If 
there any hit, an upper byte of the data output register is 
substituted by an internal code. The upper byte is com- 
posed of the following sub-fields. However, when a bit 
pattern search is designated, AH and NOA subfields do 
not exist. 

MH: This indicates either plural hits or a single hit. 

SW: This indicates that the write of data, which is 
not used for a search processing, was made. 

IF: This monitors the set/reset condition of an 
interruption request signal. 

AH: This indicates hit in an approximate character 
string search sequence. 

NOA: This indicates the id number of the approxi- 
mate character string search sequence. 

RS: This indicates the number of times of remain- 
ing data shift in the data input registers. 

[0058] According to the present embodiment, not only 
the read of the content of the RAM but also the read of 
internal information of the search module required by 
the CPU for a control of a search processing become 
possible by an address of the content addressable 
memory having the content matching with text data. 
[0059] Fig. 14 shows an embodiment of the condition 
code register (CCR) 810 which is the condition holding 
means. The condition code register 810 is constructed 
with four bits and is composed of an hi bit, an hO bit, an 
s bit and a p_ bit. The hi and hO bits indicate the result of 
comparision, the s bit indicates the presence/absence 
of remaining text data shift in the data input registers, 
and the q bit indicates whether or not data written in the 
search module 200 immediately before is to be used for 
a search processing. The details thereof are as follows, 
depending on the content of the function definition reg- 
ister 820. 

[0060] In the case of NOP (NON-OPERATION), (hi, 
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hO) = 00 (fixed), and s and q are undefinite. 
[0061 ] In the case of CRST (CONTENT ADDRESSA- 
BLE MEMORY RESET), (hi, hO) = 11 (fixed), and s and 
p are undefinite. 

[0062] In the case of CACS (COMPARE CONTROL 
MEMORY ACCESS), (hi, hO) = 01 (fixed), and s and p_ 
are undefinite. 

[0063] In the case of COMPFn (n = 0 - 3), that is, a bit 
pattern search or a character string search, 



(h1,h0) = 00: 


presence of single hit, 


01 : 


absence of hit, 


10 : 


presence of plural hits, 


11 : 


during execution of search 




sequencey, 


s = 0: 


absence of remaining shift, 


1 : 


presence of remaining shift, 


p = 0: 


text data written immediately before 




is to be used for search, 


1 : 


text data written immediately before 




is not to be used for search. 



[0064] According to the present embodiment, the read 
of internal information of the search module required by 
the CPU for a control of a search processing becomes 
possible from the content of the condition code register. 
Though this internal information can also be known from 
the content of the data output register, the provision of 
the control line bus 230 for direct connection of the con- 
dition code register and the CPU makes it possible to 
omit register access. Accordingly, there is obtained an 
effect that a higher-speed search processing can be 
performed. 

[0065] Fig. 15 shows an embodiment of input and out- 
put signals of the search module 200. The construction 
shown in Fig. 1 5 can be realized using either the posi- 
tive or negative polarity of each input/output signal. A 
clock signal, a power source or ground may be plural in 
number. An address bus or a data bus can also be real- 
ized with the number of bits other than that shown. 
[0066] Fig. 16 is a block diagram of a search proces- 
sor (or a single-chip microcomputer) 10 which is inte- 
grated on one chip with the search module of the 
present invention being incorporated. 
[0067] The search processor 10 includes a CPU 100, 
a RAM 1 1 0, a ROM 1 20 and a search module 200 of the 
present invention. These components are connected 
through an address bus 130 and a data bus 140. Also, 
the search module 200 and the CPU 100 are connected 
through a control bus 150 including a condition code, a 
wait signal and an interruption request signal. A com- 
munication with the exterior is made through an address 
buffer 160 and a data buffer 170. By using the search 
module of the present invention, there is obtained an 
effect that the speed of a character string search 
processing performed on a software in the prior art can 
be improved. The present embodiment can also be real- 
ized as a search processor with another construction so 
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long as the CPU and the search module are included. 
[0068] Fig. 17 shows an embodiment of a magnetic 
disk system 30 in which the search processor 10 of the 
present invention is incorporated as a controller. The 
present embodiment includes a disk unit 16 and a disk 5 
controller section 20. The disk controller section 20 is 
composed of an SCSI interface 13 for making a commu- 
nication of data and commands with a host CPU, a 
buffer 14 for temporarily storing data, the search proc- 
essor 10 used as a controller, a RAM 1 1 used as a work 10 
area, and a ROM 12 for storing a program and data of 
the search processor 10. The interface for the host CPU 
can also be realized with an interface other than the 
SCSI interface. 

[0069] The present embodiment not only serves as 15 
the conventional magnetic disk system with a controller 
incorporated but also has a function of searching for 
data stored in the disk owing to the use of the search 
processor 10. Namely, the system is provided with a 
normal mode in which the system is used as a usual 20 
magnetic disk device by means of the conventional 
SCSI interface and a search mode in which a search 
function based on a string search is performed. A bit 
pattern, to which an SCSI command is not allotted, is 
used as a mode change-over command to make a 25 
change-over between both the modes. The designation 
for a search function is made after a search mode has 
been established by the mode change-over command. 
In the search mode, when a search processing is desig- 
nated, the search processor 10 compares data of the 30 
buffer read from the disk and a character string (s) to be 
searched out. After the search processing has been 
completed, the search processor 10 informs the host 
CPU of whether or not there is a character string(s) 
matched. This can be realized, for example, in such a 35 
manner that the magnetic disk system 30 makes an 
interruption to the host CPU or in such a manner that 
the host CPU reads the result of search held by the 
magnetic disk system 30. After having brought the mag- 
netic disk system into a normal mode, the host CPU 40 
read only necessary data in accordance with the result 
of search. 

[0070] The interface between the magnetic disk sys- 
tem 30 and the host CPU may take a construction in 
which an extended interface is added to the SCSI inter- 45 
face. If a distinction between the normal mode and the 
search mode is made by the extended interface portion, 
the transfer of a command for mode change-over 
becomes unnecessary. Thereby, a further speed-up is 
attainable. so 
[0071] Figs. 18A and 18B comparatively show the 
conventional procedure and the inventive procedure for 
character string search from data in a disk, respectively. 
In the conventional procedure, data is transferred from a 
disk in accordance with a data transfer request from a 55 
host CPU and the host CPU performs a search process- 
ing for the data to select necessary data therefrom. In 
this case, if the amount of data to be searched is large, 



the data transfer is made plural times. In other words, 
there is a problem that the transfer amount of useless 
data becomes very large. On the other hand, in the 
present invention, a search processing is performed in 
the magnetic disk system. Therefore, it is not necessary 
to make useless data transfer between the disk and the 
host CPU which was made in the conventional system. 
Accordingly, data transfer at a speed higher than that in 
the conventional system becomes possible. This brings 
about an effect of improving the efficiency in utilization 
of the SCSI bus. Also, the present embodiment makes it 
possible to provide a novel function of content search 
which was not offered by the conventional magnetic disk 
system. 

[0072] The construction of the present embodiment 
using a search processor as a controller is also applica- 
ble to a semiconductor disk system such as a RAM disk 
system or an optical disk system which uses a CD-ROM 
or the like. 

[0073] Fig. 19 shows an embodiment of a search sys- 
tem provided with a host CPU system 40 having an 
interface for the magnetic disk system shown in Fig. 17. 
To an internal bus are connected a CPU 41 , a RAM 42, 
a ROM 43, interface means 44 for a network, and inter- 
face means 45 for a secondary storage system such as 
a magnetic disk system. The interface means 45 for the 
secondary storage system is also connected to an SCSI 
bus 46 and an extended bus 47 which are in turn con- 
nected to the secondary storage system through an 
SCSI interface port 48 and an extended interface port 
49, respectively. When a command is transferred to the 
magnetic disk system 30 (as shown in Fig. 17) having a 
search function, the extended bus 47 is used to inform 
the system 30 that an extended command is concerned. 
Accordingly, the magnetic disk system 30 and the host 
CPU system 40 make, the designation of a character 
string search and a control for transfer of a necessary 
file based on the result of search, through the SCSI bus 
and the extended bus. 

[0074] By thus using the SCSI bus and the extended 
bus, it becomes possible for the host CPU system 40 to 
possess an ordinary generalized interface as well as an 
interface for a magnetic disk system with an extended 
interface having a character string search function. In 
addition, if only the generalized interface portion is 
used, the connection to a magnetic disk 1 7 with the con- 
ventional SCSI interface can be made easily. 
[0075] A similar construction can also be realized 
using a magnetic disk system in which a search 
processing is performed through a change-over 
between a normal mode and a search mode by a mode 
change-over command. Further, if there is not a mag- 
netic disk system having an extended interface, the 
extended bus 47 and the extended interface 49 can be 
omitted from the host CPU system 40, thereby making 
the reduction in hardware scale possible. 
[0076] According to the present invention, there are 
obtained the following effects. 
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[0077] A CPU and a search module are connected by 
not a special data bus but a generalized bus. Therefore, 
seeing from the CPU, the search module looks as it is 
address-mapped. Accordingly, the execution of a 
search processing and the acquisition of the result can 5 
be made easily in a procedure similar to the procedure 
of data transfer for a memory. Also, the search module 
can be mapped at any address. Therefore, in the case 
where the configuration of a single-chip microcomputer 
is employed, good matchability with another module is 10 
obtained. Further, since the search module performs 
only a leading collation processing, it can be realized 
with a small-scale hardware. Therefore, the CPU and 
the search module can be integrated as a search proc- 
essor on the same chip, thereby improving the speed of 15 
data transfer between the CPU and the search module. 
As apparent from the foregoing, there is an effect that a 
character string search processing by a single-chip 
microcomputer can be performed at a much higher 
speed as compared with that by the conventional appa- 20 
ratus. Furthermore, an applied system can be struc- 
tured with a hardware scale smaller than that in the prior 
art. Accordingly, there is also obtained an effect that a 
content search function, which was not provided in the 
prior art, can be realized by applying the search proces- 25 
sor to a secondary storage system. 

Claims 

1. A string search apparatus (200) operable in combi- 30 
nation with a CPU (100), said apparatus being con- 
nected to the CPU (100) via an address bus and a 
data bus and comprising: 

function definition means (203) for defining a 35 
function of the apparatus in accordance with a 
command from said CPU; 
data input/output means (202) for receiving 
data via said data bus and outputting the result 
of a data processing; 40 
search processing means (201) for performing 
the data processing based on a function 
defined by said function definition means by 
use of data inputted to said data input/output 
means; and 45 
condition holding means (204) for holding data 
indicative of an internal condition correspond- 
ing to the result of the data processing, 
wherein a search process to be performed by 
the combination of the string search apparatus so 
(200) and the CPU (100) includes a search 
process, performed by the string search appa- 
ratus, for a compare start portion of the data. 

2. The apparatus according to claim 1/ which is 55 
formed on a single semiconductor chip. 

3. The apparatus according to claim 1, wherein at 



least said CPU (100), said apparatus (200), said 
address bus and said data bus are formed on a sin- 
gle semiconductor chip to provide a microcomputer. 

The apparatus according to claim 3, further com- 
prising a RAM (110) and a ROM (120) which are 
also formed on said single semiconductor chip to 
provide said microcomputer (10). 

The apparatus according to any of claims 1 to 4 
being suitable for making a decision of whether or 
not a symbol string to be searched out exists in a 
symbol string to be searched which is composed of 
symbols represented by codes, 
wherein said search processing means (201) is 
capable to perform a search processing based on a 
function defined by said function definition means 
(203) by use of the symbol string to be searched out 
which is stored in said search processing means 
and the symbol string to be searched which is 
inputted to said data input/output means (202). 

The apparatus according to claim 5, wherein said 
search processing means includes 

masking means (410) for masking data in said 
input means (300) in units of one bit; 
data path interchange means (420) for making 
a positional interchange of the data in said 
input means in units of a byte; 
a content addressable memory (500) for stor- 
ing beforehand a symbol string to be searched 
out and comparing the stored symbol string to 
be searched out and the symbol string to be 
searched which has been subjected to the 
positional interchange by said data path inter- 
change means; 

a compare condition memory (600) for desig- 
nating a condition for compare in said content 
addressable memory, that is, what relationship 
between the symbol string to be searched and 
the symbol string to be searched out is deter- 
mined as being a matching; 
compare result analysing means (700) for ana- 
lysing the number of words matched from the 
result of compare in said content addressable 
memory; 

matching signal separating means (430) for 
outputting matching signals of the words 
matched one by one; 

an output data memory (440) the access 
address of which is designated by said match- 
ing signal; and 

non-hit data holding means (450) for holding 
data which is to be outputted to said search 
result output means in the case where said out- 
put data memory is not accessed; 
wherein said input/output means (300) 
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includes search result output means (340) for 
outputting the content of said output data mem- 
ory through said data bus; and 
wherein said condition holding means (810) 
holds an internal condition of said symbol 5 
string search apparatus. 

7. The apparatus according to claim 6, wherein said 
input means (300) includes a first group of input 
means (310, 320) having a data path to said con- 10 
tent addressable memory (500) and a second 
group of input means (330, 340) having no data 
path to said content addressable memory, and said 
input means successively shifts inputted data from 
said second group of input means to said first group 15 
of input means while being passed through each 
input means so that the symbol string to be 
searched which is stored in said first group of input 
means is transferred to said content addressable 
memory. 20 

8. The apparatus according to claim 7, wherein the 
amount of shift of the symbol string to be searched 
in said input means groups (310 - 340) is one 
selected from one bit, one byte and two bytes. 25 

9. The apparatus according to claim 7 or 8, wherein 
the amount of shift of the symbol string to be 
searched in said input means groups (310-340) is 
designated by a command set into said function 30 
definition means (820). 

10. The apparatus according to claim 7, wherein when 
the symbol string to be searched which is stored in 
said first group of input means (310, 320) is trans- 35 
ferred to said content addressable memory (500), 

the arrangement of data is changed in units of one 
byte. 

11. The apparatus according to claim 10, wherein the 40 
manner of the change of the arrangement of the 
data of the symbol string to be searched which is 
stored in said first group of input means (310, 320), 

is designated by a command set into said function 
definition means (820). 45 

12. The apparatus according to any of claims 6 to 11, 
wherein when the symbol string to be searched is 
written into said input means (310 - 340), a search 
processing sequence is activated in accordance so 
with the setting by said function definition means 
(820) and either data stored in said output data 
memory (440) or data stored in said non-hit data 
holding means (450) is outputted to said search 
result output means (340). 55 

13. The apparatus according to claim 12, wherein when 
the symbol string to be searched is written, input 



means for activating the search processing 
sequence set in said function definition means 
(820) is designated by a command set into said 
function definition means. 

14. The apparatus according to any of claims 6 to 13, 
wherein the condition of comparison between the 
symbol string to be searched and the symbol string 
to be searched out is set in units of one word for the 
symbol string to be searched which is stored in said 
content addressable memory (500), and wherein 
said condition of comparing includes a condition 
under which the matching is determined in the case 
where the symbol string to be searched and the 
symbol string to be searched out are equal to each 
other, a condition under which the matching is 
determined in the case where they are not equal to 
each other, a condition under which the matching is 
always determined irrespective of a relationship 
between them, and a condition under which a mis- 
matching is always determined irrespective of a 
relationship between them. 

15. The apparatus according to any of claims 6 to 14, 
further comprising an output terminal or interface 
means for transferring the content of said condition 
holding means (810) to said CPU (100) directly. 

16. The apparatus according to any of claims 6 to 15, 
wherein in the case where the matching between 
the symbol string to be searched and the symbol 
string to be searched out is detected, an interrup- 
tion request signal is outputted. 

17. The apparatus according to claim 16, wherein 
whether or not the interruption request signal 
should be outputted in the case where the matching 
between the symbol string to be searched and the 
symbol string to be searched out is detected, is 
designated by a command set into said function 
definition means (820). 

18. The apparatus according to claim 16, wherein when 
the content of said output data memory (440) is out- 
putted to said search result output means (340) in 
accordance with a signal indicative of the matching, 
whether or not a part of said content should be sub- 
stituted by internal information of the symbol string 
search apparatus is designated by a command set 
into said function definition means (820). 

19. The apparatus according to any of claims 6 to 18, 
wherein a mask is set to the same bit position in 
units of one byte or two bytes for each symbol string 
to be searched out which is stored in said content 
addressable memory (500). 

20. The apparatus according to claim 19, wherein the 
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same bit position to which the mask is set in units of 
one byte or two bytes for each symbol string to be 
searched out which is stored in said content 
addressable memory (500), is designated by a 
command set into said function definition means 5 
(820). 

21 . The apparatus according to any of claims 6 to 20, 
wherein the contents of said content addressable 
memory (500), said compare condition memory 10 
(600) and said output data memory (440) are 
accessed from said CPU (100). 

22. The apparatus according to claim 1, wherein said 
function definition means (203) includes a first reg- 15 
ister (FDR) in which said command is set; said data 
input/output means includes a second register 
(DIR2) in which the data from said CPU is set via 
said address bus, and a third register (DOR) in 
which the result of the data processing is set; and 20 

said first, second and third registers are 
address-mapped in an address space. 

23. The apparatus according to claims 1 to 22, wherein 25 
commands to be supplied to said function definition 
means (203) from said CPU (100) includes a com- 
mand directing search for one-byte data and a com- 
mand directing search for two-byte data. 

30 

Patentanspruche 

1. Zeichenfolgen-Sucheinrichtung (200) zum Betrieb 
in Verbindung mit einer CPU (100), wobei die Ein- 
richtung uber einen Adressenbus und einen Daten- 35 
bus mit der CPU (100) verbunden ist und aufweist: 

eine Funktionsdefiniereinrichtung (203) zum 
Definieren einer Funktion der Einrichtung ent- 
sprechend einem Befehl von der CPU, 40 
eine Daten-Ein/Ausgabeeinrichtung (202) zum 
Empfangen von Daten uber den Datenbus und 
zum Ausgeben des Ergebnisses einer Daten- 
verarbeitung, 

eine Suchverarbeitungseinrichtung (201) zur as 
Durchfuhrung der Datenverarbeitung aufgrund 
einer von der Funktionsdefiniereinrichtung defi- 
nierten Funktion unter Verwendung von an der 
Daten-Ein/Ausgabeeinrichtung eingegebenen 
Daten, und 50 
eine Bedingungshalteeinrichtung (204) zur 
Aufnahme von Daten, die eine dem Ergebnis 
der Datenverarbeitung entsprechende interne 
Bedingung angeben, 

wobei ein von der Kombination aus der Zei- 55 
chenfolgen-Sucheinrichtung (200) und der 
CPU (100) durchzufuhrender Suchvorgang 
einen von der Zeichenfolgen-Sucheinrichtung 



auszufuhrenden Suchvorgang fur einen Ver- 
gleichs-Anfangsteil der Daten einschlieBt 

2. Einrichtung nach Anspruch 1 , die auf einem einzel- 
nen Halbleiterchip ausgebildet ist. 

3. Einrichtung nach Anspruch 1, wobei mindestens 
die CPU (100), die Einrichtung (200), der Adres- 
senbus und der Datenbus unter Bildung eines 
Mikrocomputers auf einem einzelnen Halbleiterchip 
ausgebildet sind. 

4. Einrichtung nach Anspruch 3 mit ferner einem RAM 
(110) und einem ROM (120), die unter Bildung des 
Mikrocomputers (10) ebenfalls auf dem einzelnen 
Halbleiterchip ausgebildet sind. 

5. Einrichtung nach einem der Anspruche 1 bis 4, die 
sich dazu eignet zu entscheiden, ob in einer zu 
durchsuchenden Zeichenfblge, die aus durch Code 
dargestellten Symbolen besteht, eine zu suchende 
Symbolfolge existiert oder nicht, 

wobei die Suchverarbeitungseinrichtung (201) in 
der Lage ist, aufgrund einer von der Funktionsdefi- 
niereinrichtung (203) definierten Funktion eine 
Suchverarbeitung unter Verwendung der in der 
Suchverarbeitungseinrichtung gespeicherten, zu 
suchenden Symbolfolge und der an der Daten- 
Ein/Ausgabeeinrichtung (202) eingegebenen zu 
durchsuchenden Symbolfolge durchzufuhren. 

6. Einrichtung nach Anspruch 5, wobei die Suchverar- 
beitungseinrichtung aufweist: 

eine Maskiereinrichtung (410) zum Maskieren 
von Daten in der Eingabeeinrichtung (300) in 
Ein-Bit-Einheiten, 

eine Datenbahn-Wechseleinrichtung (420) zur 
Durchfuhrung eines Positionswechsels der 
Daten in der Eingabeeinrichtung in Byte-Ein- 
heiten, 

einen nach dem Inhalt adressierbaren Spei- 
cher (500) zum vorherigen Speichern einer zu 
suchenden Symbolfolge und zum Vergleichen 
der gespeicherten, zu suchenden Symbolfolge 
mit der zu durchsuchenden Symbolfolge, die 
von der Datenbahn-Wechseleinrichtung dem 
Positionswechsel unterzogen wurde, 
einen Vergleichsbedingungsspeicher (600) 
zum Bezeichnen einer Bedingung fur den Ver- 
gleich in dem nach dem Inhalt adressierbaren 
Speicher, d.h. zum Bezeichnen der als Uber- 
einstimmung bestimmten Beziehung zwischen 
derzu durchsuchenden und derzu suchenden 
Zeichenfolge, 

eine Vergleichsergebnis-Analysiereinrichtung 
(700) zum Analysieren der Anzahl von uberein- 
stimmenden WOrtern aus dem Vergleichser- 
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gebnis in dem nach dem Inhalt adressierbaren 
Speicher, 

eine Ubereinstimmungssignal-Separiereinrich- 
tung (430) zum Ausgeben von Ubereinstirrv- 
mungssignalen der eins zu eins 5 
ubereinstimmenden W6rter, 
einen Ausgangsdatenspeicher (440), dessen 
Zug riff sad resse von dem Ubereinstimmungssi- 
gnal bezeichnet wird, und 

eine Fehltreffer-Datenaufnahmeeinrichtung 10 
(450) zum Aufnehmen von Daten, die nicht an 
die Suchergebnis-Ausgabeeinrichtung auszu- 
geben sind, falls nicht auf den Ausgangsdaten- 
speicher zugegriffen wird, 

wobei die Ein/Ausgabeeinrichtung (300) eine 15 
Suchergebnis-Ausgabeeinrichtung (340) zum 
Ausgeben des Inhalts des Ausgangsdatenspei- 
chers uber den Datenbus aufweist, und 
wobei die Bedingungshalteeinhchtung (810) 
eine interne Bedingung der Symbolfolgen- 20 
Sucheinrichtung aufnimmt. 

7. Einrichtung nach Anspruch 6, wobei die Eingabe- 
einrichtung (300) eine erste Gruppe von Eingabe- 
einrichtungen (310, 320) mit einem Datenpfad zu 25 
dem nach dem Inhalt adressierbaren Speicher 
(500) sowie eine zweite Gruppe von Eingabeein- 
richtungen (330, 340) aufweist, die keinen Daten- 
pfad zu dem nach dem Inhalt adressierbaren 
Speicher haben, und wobei die Eingabeeinrichtung 30 
eingegebene Daten von der zweiten Gruppe von 
Eingabeeinrichtungen unter Durchlaufen jeder Ein- 
gabeeinrichtung nacheinander zu der zweiten 
Gruppe von Eingabeeinrichtungen verschiebt, so 
dafc die in der ersten Gruppe von Eingabeeinrich- 35 
tungen gespeicherte, zu durchsuchende Symbol- 
folge in den nach dem Inhalt adressierbaren 
Speicher uberfuhrt wird. 

8. Einrichtung nach Anspruch 7, wobei das Maft der 40 
Verschiebung der zu durchsuchenden Symbolfolge 

in den Gruppen von Eingabeeinrichtungen (310 - 
340) ein Bit, ein Byte Oder zwei Bytes betragt. 

9. Einrichtung nach Anspruch 7 oder 8, wobei das 45 
Mali der Verschiebung der durchsuchenden Sym- 
bolfolge in den Gruppen von Eingabeeinrichtungen 
(310 - 340) von einem in die Funktionsdefinierein- 
richtung (820) eingegebenen Befehl bestimmt wird. 

50 

10. Einrichtung nach Anspruch 7, wobei die zu durch- 
suchende Symbolfolge, die in der ersten Gruppe 
von Eingabeeinrichtungen (310, 320) gespeichert 
ist, in den nach dem Inhalt adressierbaren Speicher 
(500) uberfuhrt wird, und wobei die Anordnung der 55 
Daten in Ein-Byte-Einheiten geSndert wird. 

11. Einrichtung nach Anspruch 10, wobei die Art der 



Anderung in der Anordnung der Daten der zu 
durchsuchenden Symbolfolge, die in der ersten 
Gruppe von Eingabeeinrichtungen (310, 320) 
gespeichert ist, von einem in die Funktionsdefinier- 
einrichtung (820) eingegebenen Befehl bestimmt 
wird. 

12. Einrichtung nach einem der Anspruche 6 bis 11, 
wobei die zu durchsuchende Symbolfolge in die 
Eingabeeinrichtungen (310 - 340) geschrieben 
wird, eine Suchverarbeitungssequenz entspre- 
chend der Einstellung der Funktionsdefiniereinrich- 
tung (820) aktiviert wird und entweder in dem 
Ausgangsdatenspeicher (440) oder in der Fehltref- 
fer-Datenaufnahmeeinrichtung (450) gespeicherte 
Daten an die Suchergebnis-Ausgabeeinrichtung 
(349) ausgegeben werden. 

13. Einrichtung nach Anspruch 12, wobei beim Ein- 
schreiben der zu durchsuchenden Symbolfolge die 
Eingabeeinrichtung zum Aktivieren der in die Funk- 
tionsdefiniereinrichtung (820) eingegebenen Such- 
verarbeitungssequenz von einem in die 
Funktionsdefiniereinrichtung eingegebenen Befehl 
bestimmt wird. 

14. Einrichtung nach einem der Anspruche 6 bis 13, 
wobei die Bedingung fur den Vergleich zwischen 
der zu durchsuchenden Symbolfolge und der zu 
suchenden Symbolfolge in Ein-Wort-Einheiten fur 
die zu durchsuchende Symbolfolge, die in dem 
nach dem Inhalt adressierbaren Speicher (500) 
gespeichert ist, eingestellt wird, und wobei zu der 
Vergleichsbedingung eine Bedingung gehort, bei 
der Ubereinstimmung festgestellt wird, falls die zu 
durchsuchende Symbolfolge und die zu suchende 
Symbolfolge gleich sind, eine Bedingung, bei der 
Ubereinstimmung festgestellt wird, wenn die beiden 
Folgen nicht gleich sind, eine Bedingung, bei der 
Ubereinstimmung unabhangig von der Beziehung 
zwischen den Folgen immer festgestellt wird, sowie 
eine Bedingung, bei der unabhangig von der Bezie- 
hung zwischen den Folgen immer eine Fehluber- 
einstimmung festgestellt wird. 

15. Einrichtung nach einem der Anspruche 6 bis 14 mit 
ferner einem AusgangsanschluB oder einer Inter- 
faceeinrichtung zum direkten Uberfuhren des 
Inhalts der Bedingungshalteeinrichtung (810) an 
die CPU (100). 

16. Einrichtung nach einem der Anspruche 6 bis 15, 
wobei dann, wenn zwischen der zu durchsuchen- 
den und der zu suchenden Symbolfolge Oberein- 
stimmung festgestellt wird, ein Unterbrechungs- 
Anforderungssignal ausgegeben wird. 

17. Einrichtung nach Anspruch 16, wobei die Frage, ob 
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bei Obereinstimmung zwischen der zu durchsu- R< 
chenden und der zu suchenden Symbolfolge ein 
Unterbrechungs-Anforderungssigual auszugeben 1. 
ist Oder nicht, von einem in die Funktionsdefinier- 
einrichtung (820) eingegebenen Befehl bestimmt 5 
wird. 

18. Einrichtung nach Anspruch 16, wobei der Inhalt des 
Ausgangsdatenspeichers (440) entsprechend 
einem die Obereinstimmung angebenden Signal an 10 
die Suchergebnis-Ausgabeeinrichtung (340) aus- 
gegeben wird, und wobei die Frage, ob ein Teil die- 
ses Inhalts durch interne Infbrmationen der 
Symbolfblgen-Sucheinrichtung zu ersetzen ist Oder 
nicht, von einem in die Funktionsdefiniereinrichtung 15 
(820) eingegebenen Befehl bestimmt wird. 

19. Einrichtung nach einem der Anspruche 6 bis 18, 
wobei fur jede zu suchende Symbolfolge, die in 
dem nach dem Inhalt adressierbaren Speicher 20 
(500) gespeichert ist, eine Maske auf die gleiche 
Bitstelle in Einheiten von einem oder zwei Bytes 
gelegt wird. 

20. Einrichtung nach Anspruch 19, wobei die gleiche 25 
Bitposition, auf die die Maske in Einheiten von 
einem oder zwei Bytes fur jede zu suchende Sym- 
bolfolge, die in dem nach dem Inhalt adressierba- 2. 
ren Speicher (500) gespeichert ist, gelegt wird, von 
einem in die Funktionsdefiniereinrichtung (820) ein- 30 
gegebenen Befehl bestimmt wird. 3. 

21. Einrichtung nach einem der Anspruche 6 bis 20, 
wobei die CPU (100) auf den Inhalt des nach dem 
Inhalt adressierbaren Speichers (500), des Ver- 35 
gleichsbedingungsspeichers (600) und des Aus- 
gangsdatenspeichers (440) Zugriff hat. 4. 

22. Einrichtung nach Anspruch 1, wobei die Funktions- 
definiereinrichtung (203) ein erstes Register (FDR) 40 
aufweist, in das der Befehl eingegeben wird, wobei 

die Daten-Ein/Ausgabeeinrichtung ein zweites 
Register (DIR2), in das Daten von der CPU uber 5. 
den Adressenbus eingegeben werden, sowie ein 
drittes Register (DOR) aufweist, in das das Ergeb- 45 
nis der Datenverarbeitung eingegeben wird, und 
wobei fur das erste, das zweite und das dritter 
Register in einem Adressenraum ein Adressenplan 
vorliegt. 

50 

23. Einrichtung nach einem der Anspruche 1 bis 20, 
wobei zu den von der CPU (100) der Funktionsdefi- 
niereinrichtung (203) zugefuhrten Befehlen ein 
Befehl, der eine Suche nach Ein-Byte-Daten 
anweist, sowie ein Befehl, der eine Suche nach 55 
Zwei-Byte-Daten anweist, gehoren. 



Dispositif de recherche de chame (200) exploitable 
en combinaison avec une unite centrale de traite- 
ment (CPU) (100), ledit dispositif etant connecte a 
la CPU (100) via un bus d'adresses et un bus de 
donnees, et comportant : 

des moyens de definition de fbnction (203) 
pour definir une fonction du dispositif confor- 
mement a une instruction provenant de ladite 
CPU, 

des moyens d'entree/sortie de donnees (202) 
pour recevoir des donnees via ledit bus de don- 
nees et emettre le resultat d'un traitement de 
donnees, 

des moyens de traitement de donnees (201) 
pour effectuer le traitement de donnees sur la 
base d'une fbnction definie par lesdits moyens 
de definition de fonction en utilisant des don- 
nees entrees dans lesdits moyens 
d'entree/sortie de donnees, 
et des moyens de maintien de condition (204) 
pour maintenir des donnees representatives 
d'une condition interne correspondant au resul- 
tat du traitement de donnees. 

Dispositif selon la revendication 1 , qui est forme sur 
une puce unique a semi-conducteurs. 

Dispositif selon la revendication 1 , dans lequel au 
moins ladite CPU (100), ledit dispositif (200), et 
ledit bus d'adresses et ledit bus de donnees sont 
formes sur une puce unique a semi-conducteurs 
pour realiser un micro-ordinateur. 

Dispositif selon la revendication 3, comportant de 
plus une memoire RAM (110) et une memoire ROM 
(120) qui sont egalement formees sur ladite puce 
unique a semi-conducteurs pour realiser ledit 
micro-ordinateur (10). 

Dispositif selon Tune quelconque des revendica- 
tions 1 a 4, qui est adapte pour prendre une deci- 
sion concernant si une chame de symboles a 
trouver existe ou non dans une chame de symboles 
a etudier qui est constitute de symboles represen- 
ts par des codes, 

dans lequel lesdits moyens de traitement de recher- 
che (201) sont capables d'effectuer un traitement 
de recherche sur la base d'une fonction definie par 
lesdits moyens de definition de fonction (203) en 
utilisant la chame de symboles a trouver qui est 
memorisee dans lesdits moyens de traitement de 
recherche et la chatne de symboles a etudier qui 
est entree dans lesdits moyens d'entree/sortie de 
donnees (202). 
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Dispositif selon la revendication 5, dans lequel les- 
dits moyens de traitement de recherche comportent 

des moyens de masquage (410) pour masquer 
des donnees dans lesdits moyens d'entree 5 
(300) en unites de un octet, 
des moyens de permutation de trajets de don- 
nees (420) pour effectuer une permutation de 
positions des donnees dans lesdits moyens 
d'entree en unites d'un octet, 10 
une memoire a contenu adressable (500) pour 
memoriser a I'avance une chaTne de symboles 
a trouver et comparer la chaTne de symboles 
memorisee a trouver et la chaTne de symboles 
a etudier qui a ete soumise a la permutation de 15 
positions par lesdits moyens de permutation de 
trajets de donnees, 

une memoire de condition de comparaison 
(600) pour designer une condition pour compa- 
rer ladite memoire a contenu adressable, c'est- 20 
a-dire quelle relation entre la chaTne de symbo- 
les a etudier et la chaTne de symboles a trouver 
est determinee comme etant une correspon- 
dence, 

des moyens d'analyse de resultat de comparai- 25 
son (700) pour analyser le nombre de mots qui 
correspondent a partir du resultat de comparai- 
son de ladite memoire a contenu adressable, 
des moyens de separation de correspondance 
(430) pour emettre des signaux de correspon- 30 
dance des mots qui correspondent un par un, 
une memoire de donnees de sortie (440) dont 
I'adresse d'acces est designee par ledit signal 
de correspondance, et 

des moyens (450) de maintien de donnees 35 
dont I'acces a ete un echec pour maintenir des 
donnees qui sont a emettre dans lesdits 
moyens de sortie de resultat de recherche 
dans le cas ou on n'a pas en acces a ladite 
memoire de donnees de sortie, 40 
dans lequel lesdits moyens d'entree/sortie 
(300) comportent des moyens de sortie de 
resultat de recherche (340) pour emettre le 
contenu de ladite memoire de donnees de sor- 
tie a travers ledit bus de donnees, et 45 
dans lequel lesdits moyens de maintien de con- 
dition (810) maintiennent une condition interne 
dudit dispositif de recherche de chaTne de sym- 
boles. 

50 

Dispositif selon la revendication 6, dans lequel les- 
dits moyens d'entree (300) comporte un premier 
groupe de moyens d'entree (310, 320) ayant un tra- 
jet de donnees allant jusqu'a ladite memoire a con- 
tenu adressable (500) et un second groupe de 55 
moyens d'entree (330, 340) n'ayant aucun trajet de 
donnees allant jusqu'a ladite memoire a contenu 
adressable, et lesdits moyens d'entree decalent 



successivement des donnees entrees dudit second 
groupe de moyens d'entree dans ledit premier 
groupe de moyens d'entree tout en les faisant pas- 
ser a travers chacun des moyens d'entree de sorte 
que la chaTne de symboles a etudier qui est memo- 
risee dans ledit premier groupe de moyens d'entree 
est transferee dans ladite memoire a contenu 
adressable. 

8. Dispositif selon la revendication 7, dans lequel la 
quantite de decalage de la chaTne de symboles a 
etudier dans lesdits groupes de moyens d'entree 
(310 a 340) est une quantite selectionnee parmi un 
bit, un octet et deux octets. 

9. Dispositif selon la revendication 7 ou 8, dans lequel 
la quantite de decalage de la chaTne de symboles a 
etudier dans lesdits groupes de moyens d'entree 
(310 a 340) est designee par une instruction etablie 
dans lesdits moyens de definition de fbnction (820). 

10. Dispositif selon la revendication 7, dans lequel, 
lorsque la chaTne de symboles a etudier qui est 
memorisee dans ledit premier groupe de moyens 
d'entree (310, 320) est transferee dans ladite 
memoire a contenu adressable (500), I'agencement 
de donnees est change en unites d'un octet. 

11. Dispositif selon la revendication 10, dans lequel la 
maniere par laquelle le changement de I'agence- 
ment des donnees de la chaTne de symboles a etu- 
dier qui est memorisee dans ledit premier groupe 
de moyens d'entree (310, 320), est designee par 
une instruction etablie dans lesdits moyens de defi- 
nition de fbnction (820). 

12. Dispositif selon Tune quelconque des revendica- 
tions 6 a 1 1 , dans lequel, lorsque la chaTne de sym- 
boles a etudier est ecrite dans lesdits moyens 
d'entree (310 a 340), une sequence de traitements 
de recherche est mise en oeuvre conformement a 
I'etablissement par lesdits moyens de definition de 
fonction (820) et selon que des donnees memori- 
sees dans ladite memoire de donnees de sortie 
(440) ou des donnees memorisees dans lesdits 
moyens (450) de maintien de donnees dont I'acces 
a ete un echec sont emises dans lesdits moyens de 
sortie de resultat de recherche (340). 

13. Dispositif selon la revendication 12, dans lequel, 
lorsque la chaTne de symboles a etudier est ecrite, 
des moyens d'entree pour mettre en oeuvre la 
sequence de traitements de recherche etablie dans 
lesdits moyens de definition de fbnction (820), sont 
designes par une instruction etablie dans lesdits 
moyens de definition de fbnction. 

14. Dispositif selon Tune quelconque des revendica- 
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tions 6 a 13, dans lequel la condition de comparai- 
son entre la chaTne de symboles a etudier et la 
chatne de symboles a trouver est etablie en unites 
de un mot pour la chaine de symboles a etudier qui 
est memorisee dans ladite memoire a contenu 5 
adressable (500), et dans lequel ladite condition de 
comparaison inclut une condition sous laquelle la 
correspondance est determinee dans le cas ou la 
chame de symboles a etudier et la chaine de sym- 
boles a trouver sont egales Tune a I'autre, une con- w 
dition sous laquelle la correspondance est 
determinee dans le cas ou elles ne sont pas egales 
Tune a I'autre, une condition laquelle la correspon- 
dance est toujours determinee quelle que soit la 
relation entre celles-ci, et une condition sous 15 
laquelle une non-correspondance est toujours 
determinee quelle que soit la relation entre celles- 
ci. 

15. Dispositif selon Tune quelconque des revendica- 20 
tions 6 a 14, comportant de plus une borne de sor- 
tie ou des moyens d'interface pour transferer 
directement le contenu desdits moyens de maintien 

de condition (810) dans ladite CPU (100). 

25 

16. Dispositif selon Tune quelconque des revendica- 
tions 6 a 15, dans lequel, dans le cas ou la corres- 
pondance entre la chaTne de symboles a etudier et 
la chaTne de symboles a trouver est detectee, un 
signal de demande d'interruption est emis. 30 



17; Dispositif selon la revendication 16, dans lequel le 
fait que le signal de demande d'interruption doit 
etre emis ou non dans le cas ou la correspondance 
entre la chaine de symboles a etudier et la chaine 35 
de symboles a trouver est detectee, est designe par 
une instruction etablie dans lesdits moyens de defi- 
nition de fonction (820). 

18. Dispositif selon la revendication 16, dans lequel, 40 
lorsque le contenu de ladite memoire de donnees 

de sortie (440) est emis dans lesdits moyens de 
sortie de resultat de recherche (340) conformement 
a un signal representatif de la correspondance, le 
fait qu'une partie dudit contenu doit etre substitute 45 
ou non a des informations internes du dispositif de 
recherche de chame de symboles est designe par 
une instruction etablie dans lesdits moyens de defi- 
nition de fonction (820). 

50 

19. Dispositif selon Tune quelconque des revendica- 
tions 6 a 18, dans lequel un masque est etabli a la 
meme position binaire en unites d'un octet ou de 
deux octets pour chaque chaine de symboles a 
trouver qui est memorisee dans ladite memoire a 55 
contenu adressable (500). 

20. Dispositif selon la revendication 19, dans lequel la 



meme position binaire a laquelle le masque est eta- 
bli en unites d'un octet ou de deux octets pour cha- 
que chaTne de symboles a trouver qui est 
memorisee dans ladite memoire a contenu adres- 
sable (500), est designee par une instruction eta- 
blie dans lesdits moyens de definition de fonction 
(820). 

21. Dispositif selon Tune quelconque des revendica- 
tions 6 a 20, dans lequel on a acces aux contenus 
de ladite memoire a contenu adressable (500), de 
ladite memoire de condition de comparaison (600) 
et de ladite memoire de donnees de sortie (440) 
depuis ladite CPU (100). 

22. Dispositif selon la revendication 1 , dans lequel les- 
dits moyens de definition de fonction (203) compor- 
tent un premier registre (FDR) dans lequel ladite 
instruction est etablie ; lesdits moyens d'entree/sor- 
tie de donnees comportent un deuxieme registre 
(DIR2) dans lequel les donnees provenant de ladite 
CPU sont etablies via ledit bus d'adresses, et un 
troisieme registre (DOR) dans lequel le resultat du 
traitement de donnees est etabli, et 

lesdits premier, deuxieme et troisieme registres 
sont mappes en terme d'adresse dans un espace 
d'adresses. 

23. Dispositif selon les revendications 1 a 22, dans 
lequel des instructions a envoyer dans lesdits 
moyens de definition de fonction (203) depuis ladite 
CPU (100) comportent une instruction dirigeant la 
recherche pour des donnees d'un octet et une ins- 
truction dirigeant la recherche pour des donnees de 
deux octets. 
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